Knowledge-Based Cascade-Correlation
نویسندگان
چکیده
Neural network modeling typically ignores the role of knowledge in learning by starting from random weights. A new algorithm extends cascade-correlation by recruiting previously learned networks as well as single hidden units. Knowledge-based cascade-correlation (KBCC) finds, adapts, and uses its relevant knowledge to speed learning. In this paper, we describe KBCC and illustrate its performance on a small, but clear problem. 1 Exist ing knowledge and new learning Most research on learning in neural networks has assumed that learning is done "from scratch", without the influence of previous knowledge. However, it is clear that when people learn, they make extensive use of their existing knowledge [1-3]. Use of prior knowledge in learning is responsible for the ease and speed with which people are able to learn new material, and for interference effects. A major limitation of neural network models of human cognition and learning is that these networks begin learning from only a random set of connection weights. This implements a tabula rasa view of each learning task that few contemporary researchers would accept. In this paper, we propose a fundamental extension of cascade-correlation (CC), a generative learning algorithm that has been useful in the simulation of cognitive development [4-9]. CC builds its own network topology by recruiting new hidden units into a feed-forward network as needed in order to reduce network error [10]. Our extension, called knowledge-based cascade-correlation (KBCC) recruits previously learned networks in addition to the untrained hidden units recruited by CC. We refer to existing networks as potential source knowledge and to a current learning task as a target. Previously learned source networks compete with each other and with single hidden units to be recruited into the target network. KBCC is similar to recent neural network research on transfer [11], sequential learning [12], lifelong learning [13], multi-tasking [14], knowledge insertion [15], modularity [16], and input recoding [17], but it tries to accomplish these functions by storing and searching for knowledge within a unified generative network approach. 2 D e s cription of KBCC KBCC is similar to CC, except that KBCC treats previously learned networks like single candidate hidden units, in that they are all candidates for recruitment into a target network. A candidate unit and a candidate network both describe a differentiable function. The connection scheme for a sample KBCC network is shown in Figure 1. This scheme is similar to that in CC except that a recruited network can have multiple weighted sums as inputs and multiple outputs, whereas a single recruited unit only has one weighted sum as input and has a single output. Among the notational conventions we use in formulating KBCC are: o ou w , : Weight between output ou of unit u and output unit o. c u i o w , : Weight between output ou of unit u and input ic of candidate c. p o f , ′ : Derivative of the activation function of output unit o with respect to its input at pattern p. p o i c c f , ∇ : Partial derivative of candidate c output oc with respect to its input ic at pattern p. p o V , : Activation of output unit o at pattern p. p oc V , : Activation of output oc of candidate c at pattern p. p ou V , : Activation of output ou of unit u at pattern p. p o T , : Target value of output o at pattern p.
منابع مشابه
Neural Network Simulations by Cascade Correlation and Knowledge-Based Cascade Correlation Networks
Cascade correlation (CC) has proven to be an effective tool for simulating human learning. One important class of problem solving tasks can be thought of as establishing appropriate connections between inputs and outputs. A CC network initially attempts to solve the task with a minimal network configuration, but when the task cannot be solved, it is powered up by recruiting a hidden unit to cap...
متن کاملPhase-I monitoring of standard deviations in multistage linear profiles
In most modern manufacturing systems, products are often the output of some multistage processes. In these processes, the stages are dependent on each other, where the output quality of each stage depends also on the output quality of the previous stages. This property is called the cascade property. Although there are many studies in multistage process monitoring, there are fewer works on prof...
متن کاملModeling Acquisition of a Torque Rule on the Balance-scale Task
We present a new model of development of children’s performance on the balance-scale task, one of the most common benchmarks for computational modeling of development. Knowledge-based cascade-correlation (KBCC) networks progress through all four stages seen in children, ending with a genuine torque rule that can solve problems only solvable by comparing torques. A key element in the model is in...
متن کاملFuzzy PD Cascade Controller Design for Ball and Beam System Based on an Improved ARO Technique
The ball and beam system is one of the most popular laboratory setups for control education. In this paper, we design a fuzzy PD cascade controller for a ball and beam system using Asexual Reproduction Optimization (ARO) technique. The ball & beam system consists of a servo motor, a grooved beam, and a rolling ball. This system utilizes a servo motor to control ball’s position on the beam. Chan...
متن کاملA new approach based on state conversion to stability analysis and control design of switched nonlinear cascade systems
In this paper, the problems of control and stabilization of switched nonlinear cascade systems is investigated. The so called simultaneous domination limitation (SDL) is introduced in previous works to assure the existence of a common quadratic Lyapunov function (CQLF) for switched nonlinear cascade systems. According to this idea, if all subsystems of a switched system satisfy the SDL, a CQLF ...
متن کامل